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Abstract 


We  study  the  family  of  power  transformations  proposed  by  Box 

and  Cox  (1964)  when  the  choice  of  tin1  power  parameter  X  is  restricted 

to  a  finite  set  ft  .  The  two  cases  in  which  obvious  answers  obtain 
K 

arc  when  the  true  parameter  X  is  an  element  of  fi^  and  when  X  is 
"far"  from  ft.,  .  We  study  the  case  in  which  X  is  ’’close"  to  ft„  , 
finding  that  the  resulting  methods  can  he  very  different  from  un¬ 
restricted  maximum  likelihood  and  that  inference  depends  on  the  design, 
the  values  of  the  regression  parameters,  and  the  distance  of  X  to 
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I.  Introduction 

I’.ox  and  Cox  (1'JoA)  suggest od  the  power  family  of  transformations, 
wherein  for  some  unknown  A  , 

(1.1)  y  ^  ~  x .  6  +  ar  .  -  t  .+  oc  .  ,  i  «*!,...  ,N  . 

1  l  li  l 

i 

Here  the  design  vectors  x .  =  (  I  •••  cip^  *  ^  =  (PQ  •••  6  j)  , 

the  arc  independent  and  identically  distributed  with  mean  zero, 

variance  one  and  distribution  F  ,  and 

y(A)  =  (y(A)_  ,)/x  X  *  0 

=  lop.  y  A  =  0  . 

They  studied  both  maximum  likelihood  and  Bayes  inference  when  F  is 
the  normal  distribution.  There  is  now  a  substantial  literature  on 
the  problem,  an  incomplete  list  of  which  includes  Andrews  (1971), 
Atkinson  (1973),  llinkley  (1975),  Birkel  and  Doksum  (1980,  denoted  B-D) , 
Carroll  (1980),  and  Carroll  and  Ruppert  (1980,  denoted  C-R)  . 

B-D  developed  an  asymptotic  theory  for  estimation.  If  the  normal 

A  *  f  A  A 

theory  MLE  is  f,  when  A  is  known  and  6  *  6(A)  when  A  is 
unknown  and  estimated  by  A  ,  they  compute  the  asymptotic  distributions 
of  N*^(0  -  6)/"  and  N*^(0*  -  3)/o  as  N  ■*  “,  o  0  .  These 
distributions  arc  different,  with  the  latter  having  a  covariance 
matrix  at  least  as  large  and  often  very  much  larger  than  that  of  the 
former;  the  estimates  A  and  8*  are  highly  variable  and  highly 
correlated  in  general.  This  suggests  that  there  is  a  large  "cost"  due 
to  estimating  the  power  parameter  A  .  Unfortunately,  these  results 
(and  independent  Monte-Carlo  work  by  Carroll  (1980))  suggest  that 
unconditional  inference  concerning  6  can  be  very  difficult  for, 
except  in  certain  balanced  designs,  inference  without  taking 

into  account  the  variability  of  A  will  be  incorrect  while  0  is 
itself  too  variable  to  be  much  help.  A  theory  for  conditional  inference 
might  prove  useful . 

It  is  relevant  to  note  that  when  6*0  and  o*l  are  known,  the 

2 

curvature  (Efron  (1975))  for  A  at  A*0  is  y  -  10.67;  F.fron  suggests 

2  ° 
that  a  value  y  >  1/8  is  "large"! 
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C-R  study  llio  prediction  problem  in  the  sense  of  estimating 
the  conditional  median  of  Y  given  a  design  point  Xo  ;  this  is 
inference  in  the  original  scale  of  the  data.  Their  results  are  much 
more  encouraging;  while  there  is  a  cost  due  to  estimating  X  ,  it 
is  generally  not  severe.  For  example,  if  B  *  (Bq  Bj  ...  B  j)  , 
the  cost  averaged  over  the  distribution  of  the  design  is 
(!+p)  '  (asymptotically  as  N  ►  °»  and  a  -*•  0) . 

We  arc  concerned  with  the  following  point  which  has  been  raised 

concerning  the  applicability  of  the  H— 1>  and  C-R  theories.  In  practice, 

one  may  be  uncomfortable  using  an  estimate  such  as  X  =  .037,  then 

the  much  more  common  log  scale  (X=0)  is  "just  as  good".  Thus  it 

is  reasonable  to  restrict  the  estimate  of  X  to  a  finite  set  fiR 

and  to  study  the  consequences  of  such  a  decision.  Asymptotically, 

ns  N  «>  but  X  and  fi  stay  fixed,  one  has  the  trivial  results 

K 

that  if  X  C  one  is  almost  always  in  the  right  scale  so  there 
arc  no  difficulties,  while  if  X  ^  flR  bias  dominates  and  no  useful 
results  are  obtainable.. 

In  Table  1  we  present  the  results  of  a  Monte-Carlo  study  for 

estimating  the  conditional  median  of  Y  given  x  .  The  model  is 

o 

simple  linear  regression  based  on  a  uniform  design  with  Bo*5,  Bj“2 
and 

-1  N  * 

(1.2)  N  l  x.  x.  -  J2  . 

i  =  l 

/ 

2 

The  errors  were  normally  distributed  with  mean  0  and  variance  a  , 
and  there  were  500  replications  of  the  experiment.  The  restricted 
power  set  was  fl  ■  {0,  ±  1/2,  t  1}  ,  and  we  made  decisions  in  this 
set  on  the  basis  of  the  likelihood.  For  a  given  X  ,  our  estimator  is 

(1  ♦  X  x  d  ♦  0) 

(1.3)  °  *  „ 

exp(  xq  6  )  I  (X  -  0) 

The  numbers  listed  in  the  Table  1  arc  the  "relative  mean  square 
errors  (MSE)",  i.e.,  the  mean  square  error  of  (1.3)  divided  by  the 
MSE  when  X  is  known.  We  list  results  for  the  or i gen  xq  ■  (1  o)  and 
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when  xq  is  n  randomly  chosen  number  of  I  lie  design;  the  latter  is 
in  effect  an  average  relative  MSE  over  tlie  distribution  of  the  design. 

In  Tabic  I  we  see  that  the  restricted  estimator  (RE)  dominates 

the  MI.K  when  A-0  (hence  -1  C  fl  )  ,  whi  le  the  MLK  dominates  when 

K 

A  (J.  .  In  tliis  latter  ease  note  that  increasing  N  or  decreasing  0 

results  in  improved  performance  of  the  MI.E  relative  to  the  RE. 

In  Table  2  we  repeat  the  above  experiment  with  the  changes 
Fo«7,  E|  =  4  .  Tlie  slightly  worse  behavior  of  the  MLK  relative  to  the 
A-known  case  is  expected  from  the  C-R  theory.  Note  here  that  the 
change  in  parameter  values  causes  the  RF.  to  be  much  worse  than  the 

MLK  if  \  {  U  .  M  so,  the  effect  of  changing  N  or  o  is 

K 

highlighted . 

From  the  Monte-Carlo,  we  see  that  the  performance  of  the  RE 
relative  to  the  MLK  depends  on  A,  N,  0  and  B  .  One  purpose  of 
the  rest  of  this  paper  is  to  propose  and  investigate  a  simple 
theory  which  gives  a  somewhat  more  systematic  understanding  of  this 
performance.  More  generally,  we  also  investigate  the  question  of 
the  feasibility  of  constructing  procedures  for  which  the  choice 
of  A  is  restricted  but  which  also  give  performance  comparable 
to  the  MLE. 
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Table  1 


The  MSK  holi.iv  i  nr  of  tbe  MU'.  .iml  III!  relative  to  tin-  ^-known 


islii  itc  oi  the  median  of  Y  i.iven 


Hr  re 


n.,  ■■  (o,  i  i/2,  •  l ),  ('■  ■-  b 

I.  o 


p  a  2 

I 


0U1GI.N  AVKRAGR 

(x  =  (01))  (x  random  member  of  design) 

o  o 


Rat  i o 


N 

0 

\ 

Ml.l’ 

RE 

(lltgh/I.ow) 

MLE 

RE 

RATIO 

20 

1 

( 

5 

1.17 

1.05 

1.11 

1  .07 

1 .01 

1.06 

20 

1/2 

1.28 

1.18 

1 .08 

1.17 

1.01 

1.16 

40 

1 

1.13 

1 .00 

1.13 

1 .08 

1.00 

•  1.08 

40 

1/2 

1 

D 

1.22 

1 .00 

1 .22 

1.16 

1.00 

1.16 

20 

1 

1 

/8 

1.13 

1.20 

1.12 

1  .07 

1.15 

1.07 

20 

1/2 

1  .  ?r> 

1 .43 

1.14 

1.13 

1.24 

1 . 10 

40 

1 

1.12 

1  .29 

1 .15 

1.03 

1 . 10 

1 .07 

40 

1/2 

1 

/3 

1.23 

1.53 

1.22 

1 .05 

1.24 

1 .18 

20 

1 

1  /  4 

1.12 

1.17 

1 .04 

1 .06 

1.10 

1 .04 

20 

1/2 

1.24 

1.34 

1 .08 

1.12 

1.24 

1 . 1 1 

40 

1 

1.10 

1.29 

1 .17 

1 .04 

1 . 10 

1.06 

40 

1/2 

1 

/A 

1.23 

1.52 

1.24 

1 .06 

1.14 

1.08 
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T.i1)l  c  2 


l'iic  MSB  behavior  of  die  MLE  and  RE  relative  to  the  X-known 
estimate  of  the  comlition.il  median  of  V  given  xq  .  Here 
VR  “  (0,  1  1/2,  i  1),  B(i  -  7  and  P(  =  4  . 


OKI GEN  AVERAGE 

(x  ^  =  (0  1))  (x^  random  member  of  design) 

Ratio 


n 

0 

X 

MLB 

RE 

(lligh/l.ow) 

MLE 

RE 

RATIO 

20 

1 

( 

) 

1.34 

1 .00 

1.34 

1.25 

1 .00 

1.25 

20 

1/2 

1.43 

1.00 

1.43 

1.27 

1  .00 

1.27 

AO 

1 

1  .23 

t  .00 

1.23 

1.32 

1.00 

1.32 

40 

1/2 

( 

) 

1.37 

1 

1 .00 

1.37 

1.27 

1 .00 

1.27 

20 

l 

1/8 

1.25 

1.72 

1.38 

1.15 

1.61 

1.40 

20 

1/2 

1.37 

2.24 

1.64 

1 . 18 

2.30 

1.95 

40 

I 

• 

1.24 

1.80 

1.45 

1 .04 

1 .59 

1.53 

40 

1/2 

1, 

/8 

1.38 

2.47 

1 .79 

1.07 

2.33 

2.18 

20 

1 

1/4 

1.24 

1.65 

1.33 

1 .  13 

1.49 

1.32 

20 

1/2 

: 

i 

1.36 

2.52 

1.85 

1.16 

2.19 

1.89 

40 

1 

1.22 

2 .  It 

1.73 

1.05 

1.42 

1.35 

40 

1/2 

1 

/4 

1.36 

3.41 

2.51 

1.09 

2.20 

2.02 

I 


I 

i 

* 
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2.  A_  large  sample  t lioory 

Any  reasonable  theory  must  have  X  "close"  to  for  large 

sample  sizes.  We  choose  to  do  this  by  letting  the  cardinality  of 

ftjj  increase  with  increasing  sample  size  N  and  by  letting  X  «*  X^ 

converge  to  a  fixed  element  of  SI  .  For  ease  of  calculation  we 

focus  on  the  important  special  case  that  the  log  scale  is  "almost" 

correct,  i.e.,  ft  always  contains  zero  and 
K 


X  =  ba/N 


1/2 


Of  course,  when  1>  =  0  the  data  truely  have  a  log-normal  distribution. 

Let  Xu  and  X  denote  the  restricted  and  ML  estimates  of  X  , 

K  M 

let  P  or  P  be  the  estimate  of  p  having  chosen  the  power 

*  "  A 

X  or  X  ,  and  let 
K  M 

f  (X ,  XoB)  -  (1  +  X  xo  p)'A  (X  +  0) 

“  exp  (  xe  P)  (X  *  0)  , 

which  is  the  condi t ion ioual  median  of  Y  given  x^  ,  with  estimate  (1.2). 
We  assume  the  errors  are  normally  distributed.  Letting  e  =  (I  o  ...  o) , 
we  assume 


x.  e'  «•  I  (there  is  an  intercept) 

N  1  T  x.'  x.  e  I  . 

‘•LI 

Then,  for  any  value  of  b  ,  when  X  is  known  the  limit  MSE  is 

(2.2)  MSE(X  known)  =  jjx^J^  exp (2  xq  6)  . 

For  fixed  o  the  computations  are  very  difficult,  so  we  will 
follow  the  lead  of  Bickcl  and  Doksum  and  consider  only  the  case  that 
o  ■  T  n  ,  where  T  *=  T(N)  >0  is  a  known  sequence;  it  simplifies 
notation  to  make  the  convention  n  ■  l  • 
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Wo  arc  now  in  a  position  to  define  the  restricted  estimate 

A„  of  X  ,  which  wo  take  hy  convention  to  satisfy  |X  |  <  1  . 

K  K  — 

Let  V  **  { }  be  a  finite  or  countably  infinite  subset  of  the 
extended  real  line  with  d^  =  0  ,  d_^  *=  -  d^  »  and  sup{d^}  -  •  . 
Define  intervals  midway  between  these  points: 

\  •  <vw/:)  • 

A  A 

Our  restricted  estimate  A„  satisfies  |A  |  1  and  maximizes 

*'  i  /2  ~ 

the  likelihood  over  the  admissible  set  with  N  XD/P  G  V  . 

K 

Asymptotically,  the  procedure  becomes 

Choose  Nl/2X,,/r  -  d.  if  Nl/2A  /r  G  B  and  |  A  J  <  I  . 
If  not  possible,  choose  XR  =  ±1  on  the  basis  of  the 
likelihood. 

The  resulting  estimate  of  B  is  &  and  the  estimate  of  the 

K  A  A 

conditional  median  of  Y  given  x  is  f ( X  ,x  B  )  . 

O  K  O  K 

The  above  procedure  is  asymptotically  the  same  as  a  restricted 

maximum  likelihood  method  and  is  quite  intuitive  as  it  choses  the 

1  /2A 

point  in  V  closest  to  N  X  /T  .  NoLc  also  that  as  N  increases, 

M 

the  number  of  possible  choices  for  scale  also  increases,  as 
desired.  Make  the  definitions: 

-I  _  N  2 

qN  -  N  Z ,  T.  x.  ►  q 

al  “  [*o  q'  “  (x08)2)/2 
CN  "  • • •  -t2/2) 

h  m  <X|  ■■•  *«  > 

eN  -  (l/4){N  '  ^  t*  -  hN|2}  **•  eD  >  0  * 

Theorem.  Using  the  B-D  asymptotics,  the  limit  distribution  of  the 
restricted  estimator  of  the  conditional  median 


(2.1) 
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N1/2  f(A..,x  P„)  -  f(A  -  bo/Kl/?fx  P 

k  (I  K  t' 

is  j-.ivi.il  by 

(2.2)  cxp(x  P)  Jx  ♦  a,  J  (dk-l>)I(c‘l/2Z2  +  b  €  Bk)|  . 

whore  and  Z.(  are  independent  standard  normal  random  variables. 

The  proof  is  in  the  appendix. 

The  Theorem  sh. ws  that  the  estimate  of  the  conditional  median 

of  Y  given  x  based  on  a  restricted  choice  of  X  is  not 
o 

necessarily  asymptotically  normally  distributed. 

Fxampl e_"^ 1 .  Suppose  that  for  any  sample  size  we  restrict  our  choice 

of  A  to  a  fixed  set,  say 
K 

flt.  »  {0,  t  1/2,  i  1)  . 

k\ 

In  (his  case  we  eventually  have  X  *=  0  so  that  V  «=  (0,  ♦  »)  and 

R 

(2.3)  MSK  (fixed  finite  set) 

-*■  exp (2  xq  P)  [”  Jxo|2  +  b2a2J 

In  simple  linear  regression  with  a  symmetric  design  and  fourth  moment 

.  2.  4 

satisfying  (1.2),  we  find  that  at  the  origen  =  ( 1  o) ,  a j *=B j  / 4  and 

e  =  P ^  (w,-l)/4  .  In  this  case,  while  (2.3)  does  not  serve  as  a 
o  I  s 

very  good  method  for  predictinj;  the  individual  values  in  Tables  1 
and  2,  it  does,  however,  lead  to  the  following  qualitative  conclusions, 
all  of  which  are  satisfied  by  the  sitmilnt iovs : 

(i)  Changing  the  value  of  N  from  20  to  40  while  fixing 

fl  and  X  basically  increase  b  by  a  factor  of  v'T. 

R  f 

Hence,  larger  values  of  N  will  result  in  a  worse 
performance  for  the  RE  when  X  t|.  fl^  . 

(ii)  Changing  o  from  1  to  1/2  increases  b  by  a  factor 
of  2  and  should  result  in  worse  performance  for  the  RE. 


y 


(iii)  Changing  t' ^  front  2  to  4  increases  t  lie  term  by 

n  factor  of  sixteen  Such  a  l.nvc  chance  should  cause 
much  worst'  pi  rform.ince  in  T.ible  2. 

(iv)  The  increase  in  (iii)  above  should  make  the  changes  in  the  RE 
when  one  changes  N  or  o  much  more  dramatic  in  Table  2  than  in 
Table  1. 

Exnmpl e  ^  2 .  The  theory  includes  the  MI.K  X  by  choosing  l>  dense. 

In  this  rase  we  get 


(2.4a)  USE  (Mi.E  of  X) 


-*■  cxp(2x  R) 
o 


l*J2  +  a  ,2 


In  the  simple  linear  regression,  at  the  origen  this  becomes 


r  -,1 

(2.4b)  exP(2Po)ji  *  (uA-l) 

Note  that  (2.4b)  is  independent  of  the  value  of  b  . 


Example  ^3.  An  interesting  example  in  which  the  number  of  possible 

values  of  X  increases  with  N  occurs  when  V  *  (all  integers}  . 

It  is  not  too  uni easonab 1 c  to  suspect  that  this  restricted  estimate 

will  be  at  least  comparable  to  the  MLK,  perhaps  somewhat  better 

when  b=0  and  hence  X  f  E  ,  but  not  too  much  worse  when  b  ■  1 /2 

K 

and  X  ^  flR  .  In  this  case 
(2.5)  MSE  (restricted  procedure) 
exp(2xQ6) 


Tnc  only  important  difference  be  tween  (2 . 4a)  and  (2.5)  is  the  term 

7  (k-b)?e  l>(e"’  /2Z  +  b  C  R,  1  . 

“  olo  k 1 


II  ^  2 

+  (a  2  e~ 1 )  T  (k-b)2  c  P{e',/2Z  +  b  e  B.  } 
log  o  1  o  k' 

k 


V 
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Iii  Table  3  we  compare  t  ho  values  of  (2.41>)nnd  (2.5)  for  Hu*  uniform 
simple  1  incur  regression  design  of  the  introduction  with  *  1.79; 
nil  comparisons  me  at  (lie  origen  x  =  (I  o) . 


Table _3 


Comparison 

of  MS",  for 

a  simple  lineal 

regression  design  with  moments 

=  »*3  -  0 

.  \‘2  K  1  , 

h^l.79,  xw 

(1  o) 

b 

_ MSF._(rjdO_ 

MSK ( X  known) 

MSK (RE  ) 

MSK (X  known) 

MSE  (RE  ) 
MSl-(MLE) 

0 

1.3 

2.27 

2.37 

1 .04 

0 

2.0 

2.27 

2.59 

1 .14 

0 

4.0 

2.27 

I  .03 

.45 

1/2 

1.5 

2.27 

2.37 

1.04 

1/2 

2.0 

2.27 

2.61 

1.15 

1/2 

4.0 

2.27 

129-42 

57.01 

The  results  are  somewhat  surprising.  First  note  that  the 
case  b = 0  corresponds  to  situations  in  which  X  truely  belongs 
to  the  set  0^  .  The  restricted  estimate  does  not  always  outperform 
the  MLE,  although  it  does  for  large  Rj  .  What  is  even  more  inter¬ 
esting  is  the  case  b=  1/2  ,  which  is  one  of  the  simplest  cases 
in  which  X  is  not  in  the  set  ft  although  it  is  quite  close. 

Here  we  see  that  the  restricted  procedure  can  perform  very  badly 
indeed . 

Tables  1-3  and  the  Theorem  thus  suggest  that  if  the  number  of 
*  .  1  /o 

possible  choices1  of  scale  is  only  on  the  order  of  N1'  ,  the  per¬ 

formance  of  the  resulting  estimates  will  differ  from  estimates  based 
on  the  MLE  of  X  ,  in  some  cases  being  better  but  in  others  being 
very  much  worse.  If  one  has  no  prior  belief  or  evidence  that  only 
a  finite  number  of  values  of  X  arc  possible,  but  rather  in 
estimating  the  conditional  median  of  Y  given  x  one  wants  to 
make  only  "reasonable"  choicer,  of  X  while  retaining  MLF.-type 
behavior,  the  number  of  possible  choices  of  X  will  have  to  be 


v 


■  H 

-II- 
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AppcndjjX 

Wc  will  use-  cent  igui  ty  techniques  (llajok  and  Sidak  (1978)).  Let 
Lj  he  the  1  og- 1  i kol i hood  when  b-0  and  let  be  the  log-likelihood 

for  fixed  b  ♦  0  .  Somewhat  detailed  calculations  show  that  as 
N  "  ,  a  ■*  0,  under  the  distribution  Lj  with  X  =  0  , 

,  N  A  N 

(A.l)  -  <I-2~1  , )  -  (b  /8N)  l  t 7  *  (bN  ')  l  if  /2  +  op(l)  . 

This  shows  that  the  case  h  *  0  is  contiguous  to  the  case  b = 0  . 


l*i  oof  of  the  Theorem:  When  X  ■  b  *•  0  it  follows  by  a  Taylor  expansion 


n  V  tli.it  as  N  ►  >  ,  o  >  0 


:'N  •  N'/J  «VV  -  ‘<A 


=0,8)  exp (_x  8) 
J 


■=  N  )  x  xr.  ♦  a,N  X  /o  +  o  (1)  . 

.  ,  o  i  j  1  R'  p 

l*  I  * 

Also,  B-0  show  that  when  X  «  0  , 

(A. 3)  2o  H]/?y/o  *=  N_1/2  V  (t  ?  -x.  q  ' )  e .  ♦  0  (1)  . 

o  M  ‘  l  i  '  l  p 

It  is  easy  to  check  that  the  r.h.s.  of  (A. 3)  is  asymptotically  independent 
of  the  first  term  >n  the  r.h.s.  of  (A. 2).  We  now  use  the  definition  of 
XR  and  the  convention  o  «  r(N)n  ■  r(N)  to  obtain  that  when  X  •  0  , 
as  N  -►  «  and  o  -»  0  , 

(A. 4)  SN  -  N~l/2  |  xox!e.  ♦  a  \  A  I(N,/2X  /o  €  BR)  ♦  o  (!)  . 

i=l  k  v 

We  arc  now  in  a  position  to  use  Theorem  7.2  of  Roussas  (1972,  page  38). 

In  his  notation. 


T  *  N  *  \  (x  x!e.  q  x!c.  t.2e.) 

N  *'  'nil  1  ll  *  t 


li  11 


(A. 5) 


T  -  F.  T  T 
N  N 


h  -  (0 


0 


-b/2)  . 


-  n  - 

One  can  show  that  the  terms  in  (A. 5)  satisfy  the  conditions  of  Rotissas' 

-J  /  2 

Theorem  7.2  so  that  when  X  ■  b  oN  ,  as  N  -*•  «•  and  o  -*•  0,  Tu  is 

N 

asymptotically  normally  distributed  with  mean  Th  and  covariance  T  . 

1/2  * 

Because  of  (A. 3),  this  means  that  N  “  X  Jo  and  the  first  term  on  the 

-j/2  "  , 

r.h.s.  of  (A. 2)  arc,  when  X  =  b  a  N  ,  jointly  asymptotically  normally 

_  I  2 

distributed  with  mean:;  (h  -bx  q/2)  ,  variances  (e  ,  ljx  !!  )  and  zero 

o  o  "  o 

covariance.  From  this  we  obtain  that  (2.1)  is  asymptotically  distributed 
with  the  same  distribution  as 

-  b‘V“,  l  V(%/2Vb  e  V  - 

k 

where  Zj  and  7.^  are  as  in  the  Theorem.  This  completes  the  proof. 

□ 


